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Abstract: 

We present a randomized method to approximate any vector v from some 
set T C M". The data one is given is the set T and k scalar products 
((Xj, t>))f^]^, where (Xj)^^^ are i.i.d. isotropic subgaussian random vectors in 
M", and k <^ n. We show that with high probability, any y E T for which 
((Xj, y))i=i is close to the data vector ((Xj, f will be a good approxima- 
tion of V, and that the degree of approximation is determined by a natural 
geometric parameter associated with the set T. 

We also investigate a random method to identify exactly any vector which 
has a relatively short support using linear subgaussian measurements as 
above. It turns out that our analysis, when applied to {—1, l}-valued vectors 
with i.i.d, symmetric entries, yields new information on the geometry of faces 
of random { — 1, l}-polytope; we show that a fc-dimensional random { — 1,1}- 
polytope with n vertices is m-neighborly for very large m < ck/ log{c'n/k). 

The proofs are based on new estimates on the behavior of the empirical 



process sup^g^ 



when F is a subset of the L2 sphere. 



The estimates are given in terms of the 72 functional with respect to the '?/'2 
metric on F, and hold both in exponential probability and in expectation. 
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Introduction 



The aim of this article is to investigate the hnear "approximate reconstruc- 
tion" problem in M". In such a problem, one is given a set T C and the 
goal is to be able to approximate any unknown f G T using random linear 
measurements. In other words, one is given the set of values 
where Xi,...,Xk are independent random vectors in selected according 
to some probability measure fi. Using this information (and the fact that 
the unknown vector v belongs to T) one has to produce, with very high 
probability with respect to fi^, some t & T, such that the Euclidean norm 
|t — i;| < e{k) for e{k) as small as possible. Of course, the random sampling 
method has to be "universal" in some sense and not tailored to a specific set 
T; and it is natural to expect that the degree of approximation e{k) depends 
on some geometric parameter associated with T. 

Questions of a similar flavor have been thoroughly studied in nonpara- 
metric statistics and statistical learning theory (see, for example, |BBLj and 
(Ml IMl] and references therein). This particular problem has been addressed 
by several authors (see jCTH ICT2| ICT3| IRVj for the most recent contribu- 
tions), in a rather restricted context. First of all, the sets considered were 
either the unit ball in or the unit balls in weak spaces for < p < 1 
- and the proofs of the approximation estimates depended on the choice of 
those particular sets. Second, the sampling process was done when Xi were 
distributed according to the Gaussian measure on M" or in jCTlj for Fourier 
ensemble. 

In contrast, we present a method which is very general. Our results 
hold for any set T C M", and the class of measures that could be used is 
broad; it contains all probability measures on which are isotropic and 
subgaussian, that is, satisfy that for every y G M", E| (X, y) p = and the 
random variables {X,y) are subgaussian with constant a\y\ for some a > 1. 
(see Definition 12.21 below). This class of measures contains, among others, 
the Gaussian measure on M", the uniform measure on the vertices of the 
combinatorial cube and the normalized volume measure on various convex, 
symmetric bodies (e.g. the unit balls of ip for 2 < p < oo). 

It turns out that the key parameter in the estimate on the degree of 
approximation e{k) is indeed geometric in nature. Moreover, the analysis of 
the approximation problem is centered around the way the random operator 
r = X]i=i (^«; ■) (where (ej)*L]^ is the standard basis in M'^) acts on subsets 
of the unit sphere. 
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Our geometric approach, when apphed to the sets T considered in |CTlj , 
yields the optimal estimates on e{k), and with better probability estimates 
(of the order of 1— exp(— cfc)), even when the sampling is done according to an 
arbitrary isotropic, subgaussian measure. Moreover, our result is more robust 
that the one from |CTlj in the following sense. The reconstruction method 
suggested in |CTlj is to find some y G T such that for every 1 < i < k, 
{Xi,v) = {Xi,y), and then to show that such v and y must necessarily be 
close. From our theorem it is clear that one can choose any y & T for which 
Yli=i i-^iy y ~ '^)^ relatively small, which is far more stable algorithmically. 

For the moment, let us present a simple version of the main result we 
prove in this direction, and to that end we require the following notation. 
Let be independent standard Gaussian random variables. Let T C M"' 

be a star-shaped set (i.e. for every t E T and < A < 1, At G T) and consider 
the following geometric parameter 



4(T) = Esup 



i=l 



which is, up to a factor of the order of 2^/n, the mean width of the body 
T. We now define a more sensitive parameter, which as we will see in this 
article, is the right parameter to control the error term e{k) for a star-shaped 
set T: 



r*(^,T) := infjp > : p > caH,{T n pS""'^ 



We may now state a version of our main result concerning approximate 
reconstruction. 

Theorem A There exist an absolute constant C\ for which the following 
holds. Let T be a star-shaped subset o/R". Let p be an isotropic, subgaussian 
measure with constant a on and set be independent, selected 

according to fi. Then, with probability at least 1 — exp(— cifc/a'*), every y,v & 
T satisfy that 



' k \ 1/2 

|y - t;| < 2 f i ^ ((X„ t;) - (X„ y)f + r*(l/2, T - T). 

> i=l J 



The parameter rl{6, T) can be estimated for unit ball of classical normed 
or quasi-normed spaces (see Sectional). In particular, if T = 5" then with 
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the same hypothesis and probabihty as above, one has 



\y - "I < 2 (i t ({X., v) - (A-,. y)A + ca^^Ei^^ 

where c > is an absolute constant; this leads to the optimal estimate for 
e{k) for that set. 

The main idea in the proof of this theorem is the fact that excluding a set 
with exponentially small probability, the random operator Yl^=i i-^iy ') 
a very good isomorphism on elements of T whose Euclidean norm is large 
enough (see Section |2l for more details). 

A question of a similar nature we investigate here focuses on "exact recon- 
struction" of vectors in R*^ that have a short support. Suppose that z G M"' is 
in the unit Euclidean ball, and has a relatively short support m << n. The 
aim is to use a random sampling procedure to identify z exactly, rather than 
just to approximate it. The motivation for this problem comes from error 
correcting codes, in which one has to overcome random noise that corrupts a 
part of a transmitted signal. The noise is modelled by adding to the trans- 
mitted vector the noise vector z. The assumption that the noise does not 
change many bits in the original signal implies that z has a short support, 
and thus, in order to correct the code, one has to identify the noise vector z 
exactly. Since error correcting codes are not the main focus of this article, 
we will not explore this topic further, but rather refer the interested reader 
to )MSt ICT2| IRVj and references therein for more information. 

In the geometric context we are interested in, the problem has been stud- 
ied in jnT2llRVl . where it was shown that if z has a short support relative to 
the dimension n and the size of the sample k, and if F is a x n matrix whose 
entries are independent, standard Gaussian variables, then with probability 
at least 1 — exp(— cfc), the minimization problem 

(P) min||f||£n for rv = rz, 

has a unique solution, which is z. Thus, solution to this minimization problem 
will pin-point the "noise vector" z. The idea of using this minimization 
problem was first suggested in |CDSj . 

We extend this result to a general random matrix whose rows are Xi, X^, 
sampled independently according to an isotropic, subgaussian measure. 
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Theorem B Let T be as above. With probability at least 1 — exp(— Ci k/a^), 
any vector z whose support has cardinality less than / \og{c^n / k) is the 
unique minimizer of the problem {P), where Ci, C2, C3 are absolute constants. 

Interestingly enough, the same analysis yields some information on the 
geometry of {—1,1} random polytopes. Indeed, consider the k x n matrix F 
whose entries are independent symmetric {—1,1} valued random variables. 
Thus, r is a random operator selected according to the uniform measure on 
the combinatorial cube { — 1, 1}", which is an isotropic, subgaussian measure 
with constant a = 1. The columns of F, denoted by vi, ...,Vn are vectors in 
{ — 1, l}'^ and let = conv(t>i, Vn) be the convex polytope generated by 
F; is thus called a random { — 1, l}-polytope. 

A convex polytope is called m-neighborly if any set of less than m of 
its vertices is the vertex set of a face (see 0). The following result yields 
the surprising fact that a random { — 1, l}-polytope is m-neighborly for a 
relatively large m. In particular, it will have the maximal number of r-faces 
for r < m. 

Theorem C There exist absolute constants Ci, C2, C3 for which the following 
holds. For 1 < k < n, with probability larger than 1 — exp(— ci/c), a k- 
dimensional random { — 1, 1} convex polytope with n vertices is m-neighborly 
provided that 

, Cafe 
~ log(c3n/A;) 

The main technical tool we use throughout this article, which is of in- 
dependent interest, is an estimate of the behavior of the supremum of the 
empirical process f ^ Zf = kr^ Yl\=i Pi^i) ~ 1 indexed by a subset of the 
L2 sphere. That is, 

where Xi, are independent, distributed according to the a probability 

measure /x, and under the assumption that every / G -F satisfies that E/^ = 1. 
We assume further that F is a bounded set with respect to the ip2 norm, 
defined for a random variable Y by 

= inf {m > : Eexp(rVw^) < 2} • 



sup 
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To formulate the following result, we require the notion of the 7^ func- 
tional |l'a2j ■ For a metric space {T,d), an admissible sequence of T is a 
collection of subsets of T, {Tg : s > 0}, such that for every s > 1, |Ts| = 2^" 
and |To| = 1. for p = 1,2, we define the 7^ functional by 

00 

7p(r,d) = infsupV2^/^ci(t,T,), 

where the infimum is taken with respect to all admissible sequences of T. 

Theorem D There exist absolute constants Ci, 02,03 and for which the fol- 
lowing holds. Let {Q, fi) be a probability space, set F be a subset of the unit 
sphere of L2{^) and assume that diam(F, || \\^^) = a. Then, for any 6 > 
and k > 1 satisfying 

Cia72 

with probability at least 1 — exp{—C20'^k/a'^), 

sup \ Zf\ < 9. 

Moreover, if F is symmetric, then 

TIT I v I ^ 272(-^, II 11^2) 

Esup |z/| < c^a _ ■ 

feF Vk 

Theorem D improves a result of a similar flavor from |KMj in two ways. 
First of all, the bound on the probability is exponential in the sample size k 
which was not the case in |KMj . Second, we were able to bound the expecta- 
tion of the supremum of the empirical process using only a 72 term. This fact 
is surprising because the expectation of the supremum of empirical processes 
is usually controlled by two terms; the first one bounds the subgaussian part 
of the process and is captured by the 72 functional with respect to the un- 
derlying metric. The other term is needed to control sub-exponential part 
of the empirical process, and is bounded by the 71 functional with respect 
to an appropriate metric (see |l'a2j for more information on the connections 
between the 7^ functionals and empirical processes). Theorem D shows that 
the expectation of the supremum of |Z/| behaves as if {Zf : / e F} were a 
subgaussian process with respect to the ip2 metric (although it is not), and 
this is due to the key fact that all the functions in F have the same second 
moment. 
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We end this introduction with the organization of the article. In Sec- 
tion n we present the proof of Theorem D and some of its corollaries we 
require. In Section |21 we illustrate these results in the case of linear processes 
which corresponds to linear measurements. In Section El we investigate the 
approximate reconstruction problem for a general set, and in Section |3] we 
present a proof for the exact reconstruction scheme, with its application to 
the geometry of random {—1, l}-polytopes. 

Throughout this article we will use letters such as c, ci, .. to denote abso- 
lute constants which may change depending on the context. We denote by 
{ei)f^i the canonical basis of M"", by |x| the Euclidean norm of a vector x 
and by B2 the Euclidean unit ball. We also denote by |/| the cardinality of 
a finite set /. 

Acknowledgement: A part of this work was done when the second and the 
third authors were visiting the Australian National University, Canberra; and 
when the first and the third authors were visiting Universite de Marne-la- 
Vallee. They wish to thank these institutions for their hospitality. 

1 Empirical Processes 

In this section we present some results in empirical processes that will be 
central to our analysis. All the results focus on understanding the process 
— ^Yli=i f'^i-^i) ~ where A; > 1 and Xi,...,Xk are independent 
random variables distributed according to a probability measure /z. In par- 
ticular, we investigate the behavior of sup^g^ \ Zf\ in terms of various metric 
structures on F, and under the key assumption that every f E F has the same 
second moment. The parameters involved are standard in generic chaining 
type estimates (see [ra2j for a comprehensive study of this topic). 

Recall that the ipp norm of a random variable X is defined as 

||X||^^ = inf {m > : Eexp{\X\P/uP) < 2}. 

It is standard to verify (see for example |VWj ) that if X has a bounded 
ifj2 norm, then it is subgaussian with parameter c||X||^2 for some absolute 
constant c. More generally, a bounded ipp norm implies that X has a tail 
behavior, P(|X| > u), of the type exp{—cuP/\\X\\^^). 

Our first fundamental ingredient is the well known Bernstein's inequality 
which we shall use in the form of a ipi estimates ( |VWj ) . 
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Lemma 1.1 Let Yi,...,Yk be independent random variables with zero mean 
such that for some b > and every i, ^Y^W^^ < b. Then, for any u > 0, 

^(l^E^^I > ^) ^ 2exp f -cA;min(^, ^) j , (1.1) 

i=i ^ ' 

where c> is an absolute constant. 

We will be interested in classes of functions F C iv2(/x) bounded in the 
^2-norm; we assume without loss of generality that F is symmetric and we let 
diam(F, || H^^) '■— 2supjg^ II / II ■02 ■ Additionally, in many technical arguments 
we shall often consider classes F C Sl^, where — {/ : ||/||l2 — 1} is the 
unit sphere in L2{fi). 

Let Xi, X2, ... be independent random variables distributed according to 
Ij,. Fix k > 1 and for / e F define the random variables Zf and Wf by 

1=1 \ i=l / 

The first lemma follows easily from Bernstein's inequahty. We state it 
in the form convenient for further use, and give a proof of one part, for 
completeness. 

Lemma 1.2 There exists an absolute constant ci > for which the following 
holds. Let F C Sl2, ol — diam(F, || H^j) and set k > 1. For every f,g & F 
and every u>2 we have 

^{Wf_g > u\\f - < 2exp(-ciA;«2). 

Also, for every u > 0, 

P(|Z/ - Zg\ >ua\\f- c/II^J < 2exp(-ciA;min(M,M^)), 

and 

> ua'^) < 2exp{-cikmm{u,u^)). 
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Proof. We show the standard proof of the first estimate. Other estimates 
are proved similarly (see e.g., |KMj . Lemma 3.2). 
Clearly, 

k 

= l^f2(f - afix.) = ^(f - anx,) = 11/ - g\\l. 
1=1 

Applying Bernstein's inequality it follows that for t > 0, 
P(|W/%-|l/-9lliJ >i) 
< 2 -p(-«=--(||(; (||(; J))- 

Since = \\h\\^^ for ever j function h, then letting t = {u"^ — l)\\f — g\\^^, 

p(w^/% > u'\\f-g\\l) 



< ^{Wf_^-\\f-g\\l>{u'-l)\\f-g\ 

< 2 exp {-ck min(uV2, , 



as promised. 



Now we return to one of the basic notions used in this paper, that of 
the 72-functional. Let (T, d) be a metric space. Recall that an admissible 
sequence of T is a collection of subsets of T, {Tg : s > 0}, such that for every 
s > 1, \Ts\ = 2^' and |To| = I. 

Definition 1.3 For a metric space {T,d) and p = 1,2, define 

oo 

^p{T,d) = infsnpy^r/Pd{t,Ts), 

where the infimum is taken with respect to all admissible sequences of T. 
In cases where the metric is clear from the context, we will denote the 7p 
functional by 'jp{T). 

Set TT^ : T — > Ts to be a metric projection function onto Tg, that is, for ev- 
ery t & T, TTs{t) is a nearest element to t in Tg with respect to the metric d. It 



9 



is easy to verify by the triangle inequality that for every admissible sequence 

and every t G T, Er=o 2'^'^(^^+i W> W) < (1 + 1/v^) Er=o 2^/'^(^' 
and that diam(T, d) < 272 (T, d). 

We say that a set F is star-shaped if the fact that f E F implies that 
Xf e F for every < A < 1. 

The next Theorem shows that excluding a set with exponentially small 
probability, Wj is close to being an isometry in the L2{fi) sense for functions 
in F that have a relatively large norm. 

Theorem 1.4 There exist absolute constants c, c > for which the following 
holds. Let F C L2{fi) be star-shaped, a = diam(F, || H^J ^'^^ k > 1. For any 
< 6 < 1, with probability at least 1 — exp{—c6'^k/a^), then for all f & F 
satisfying E/^ > r^(^)^, we have 

(1 - ^)E/2 < — ^ < (1 + 0)E/2, (1.2) 

rh 

where 

rm = rl(e,F) := inf{, > : , > .^ ^.(F n pg^. || \U,) y 

The two-sided inequality ()1.2p is intimately related to an estimate on 
supjg^ which, in turn, is based on two ingredients. The first one shows, 
in the language of the standard chaining approach, that one can control the 
"end parts" of all chains, its proof is essentially the same as Lemma 2.3 from 

jKMl. 

Lemma 1.5 There exists an absolute constant C for which the following 
holds. Let F C Sl.^, a = diam(F, || H^^) and k > 1. There is F' d F such 
that \F'\ < 4^^ and with probability at least 1 — exp(— /c), we have, for every 

feF, 

W^/-.,,(/)<C72(F,|| \U/Vk, (1.4) 
where i^F'if ) is a nearest point to f in F' with respect to the il)2 metric. 

Proof. Let {Fs : s > 0} be an "almost optimal" admissible sequence of F. 
Then for every f & F, 

oo 

^2^/2||7r,+i(/)-vr,(/)||^, <272(F,|| 

s=0 
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Let So be the minimal integer such that 2'^° > k, and let F' = Fg^. Then 
IF'I < 22^= = 4^=. Write 



/-^.o(/) = E(^^+i(/)-^«(/))- 

s=so 

Since W is sub-additive then 



(/)■ 



For any f & F, s > sq and ^ > 2, noting that 2* > k, it follows by Lemma IT!^ 
that 

P > e^||vr,+i(/) - 7r,(/)||^,,^ < 2exp(-cie'2^). (1.5) 

Since \Fs\ < there are at most 2^"^^ pairs of t^s+\[,!) and Tisif)- 
Thus, for every s > Sq, the probability of the event from (jl.Sp holding for 
some / G F is less than or equal to 2^°^^ ■ 2 exp(— ci,^^2'') < exp(2''+^ — c^^2''), 
which, for ^ > := max(4/yci, 2), does not exceed exp(— Ci,^^2'^~-'^). 

Combining these estimates together it follows that 

°° 9*/2 ( II II ^ 

s=so ^ ^ 

outside a set of probability 

oo 

^exp(-cie^2^-i) < exp(-cie^2^V4). 

s=so 

We complete the proof setting, for example ^ = max(^o, 2/ y/ci) and recalling 
that 2'^« > k. m 

Remark 1.6 The proof of the lemma shows that there exist absolute con- 
stants c', c" > such that for every ^ > c', 

P(^sup > e^^^^^^jy^j < exp(-crfc), 

a fact which will be used later. 
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The next lemma estimates the supremum supyg^/ \Zf\, where the supre- 
mum is taken over a subset F' of F of a relatively small cardinality, or in 
other words, over the "beginning part" of a chain. However, in order to get 
an exponential in k estimates on probability, we require a separate argu- 
ment (generic chaining) for the "middle part" of a chain while for the "very 
beginning" it is sufficient to use a standard concentration estimate. 

Lemma 1.7 There exist absolute constants C and d" > for which the 
following holds. Let F C and a = diam(F, || H^J. Let k > 1 and F' G F 
such that \F'\ < 4'^. Then for every w > 0, 

\ry \ ^ l2iF, II llv-a) , 2 

sup \Zf \ < Ca -^-^ h a w, 

f&F' \/k 

with probability larger than or equal to 1 — 3exp(— c'" k mm{w,w'^)). 

Proof. Let (Fs)^o be an almost optimal admissible sequence of F', set Sq to 
be the minimal integer such that 2'*'' > 2k and fix si < Sq to be determined 
later. Since |F'| < 4^, it follows that Fg = F' for every s > Sq, and that 

so 



+1 



By Lemma fl. 21 for every / G F', I < s < Sq and n > 0, 

< 2exp(-ci mm{{u^/¥Jk), {u^/¥Jkf)) 



P( IZ^^(^) - Z^,_,(/)| > ua 



< 2exp(-ci min(M,M^)2'' ^). 

(For the latter inequality observe that if s < sq then 2*/fc < 4, and thus 
min 

Taking u large enough (for example, u = 2^/ci will suffice) we may ensure 
that 

J2 22^^%xp(-ciM2^'2) < J2 exp(-2^+3) < exp(-2^0. 

s=si+l s=si+l 
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Therefore, since there are at most 2^"^^ possible pairs of 7!'s+i{f) and iTsif), 
there is a set of probability at most exp(— 2*^) such that outside this set we 
have 

sup < -7^ 2^2 / ||7r,+i(a;) -7r,(x)||^2 < 

Denote Fg^ by F" and observe that \F"\ < 2^"^ Thus the later estimate 
implies 

1 7 I . , 12{F) 
sup < ca — ■j= — h sup \Zg\. 
f&F' V A; geF" 

Applying Lemma fl .21 for every w > we get 

P (l^gl > a^w) < 2 exp(— CiA; min(w, w^)). 

Thus, given w > 0, choose si < so to be the largest integer such that 
2^^ < cikmm{w,w'^)/2. Therefore, outside a set of probability less than 
or equal to 2exp(— cifcmin(iy, w^)) < exp(— ciA;min(i(7, w^)/2) we have 
\Zg\ < a^w for all g G F" . To conclude, outside a set of probability 
3exp(— ciA;min(i(7, w^)/2), 

sup \Zf \ < c a — -= — \- a w, 
feF' y/k 

as required. ■ 

Proof of Theorem II. 4L Fix an arbitrary p > 0, and for the purpose of this 
proof we let F{p) = F/p n Sl^, where F/p = {f /p : f e F}. 

Our first and main aim is to estimate supj^p^^^-^ \Zf\ on a set of probability 
close to 1. 

Fix 'u,w > to be determined later. Let F' C F{p) be as Lemma ll.5| 
with < 4^ For every / G F{p) denote by 7r(/) = np'if) a closest point 
to / with respect to the ip2 metric on F{p). By writing / = (/ — 7r(/)) + 7r(/), 
it is evident that 

r2 



<^/-^{/)+2W^/-(/)W^.{/) + l^.{/)l 



and thus. 



sup \Zf \ < sup Wj-_^rj-^ + 2 sup sup Wg + sup \Zg\. (1.6) 

feF(p) /eF(p) /eF(p) g&F' gdF' 
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Applying Lemma ll.5l the first term in is estimated using the fact 
that 

sup < C ^= , 

with probabihty at least 1 — exp(— A;). 

For every / G Fi^p) and every -u > we have 

PjPF/ > Wua^} <^{W) > 1+ua'^} < P{|^/| > ua'^} 

and, by Lemma TTT^l the latter probability is at most 2exp(— c/cmin('u,'u^)), 
where c > is an absolute constant. 

Combining these two estimates with Lemma 11.71 and substituting into 
()1.6|) . supjg^(p) \Zf\ is upper bounded by 

C2l2{F{p),\\ ^ ^72(F(P)J \U ^ 

k Vk 

, ^„^ l2{F{p), II II^J , ,2„„ n 7^ 

V A; 

with probability at least 1 - 2e"^' - 2e-"'=™''("'"') - 3e-'='=™'^("''^'). 

Given < 6^ < 1 we want the condition supjgj7(^) |Zj| < 6* to be satisfied 
with probability close to 1. This can be achieved by imposing suitable con- 
ditions on the parameters involved. Namely, if m = 1/a^ < 1 and if p > 
and w > satisfy 

^^ 72(i^(p),|| lUJ < ^///^2^ < ^^_g) 

y/k 

where C = max(2 C,C"), then each of the last three terms in ()1.7|) is less 
than or equal to 6/4, and the first term is less than or equal to (^/4)^. 

In order to ensure that (jl.8p holds, we let w = min(l, ^/(4C""a^)) . The 
above discussion shows that as long as p satisfies 

^^^ 72(F(P)J lUJ ^ 

eVk ~ 

then supjg^(p) |Z/| < 6* on a set of measure larger than or equal to 1 — 
'j^-cke^/a^^ where c> is an absolute constant. Hence, whenever p satisfies 
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(USD then (HID holds for all / G F(p). Finally, note that j2{F{p), \\ H^J = 
(l/p)72(Fnp5'i2, II 11^2), and thus (jl.9|) is equivalent to the inequality in the 
definition of rl{9). 

To conclude the proof, for a fixed < 6 < 1 set r = rl{9), with c = 4(7 
being the constant from (jl.9p . Note that if Xi,...,Xk satisfy (|1.2p for all 
/ G F{r) then, since F is star-shaped, the homogeneity of this condition 
implies that the same holds for all / G -F with E/^ > r^, as claimed. ■ 

Let us note two consequences for the supremum of the process Zf, which 
is of independent interest. 

Corollary 1.8 There exist absolute constants C',c' > for which the fol- 
lowing holds. Let F C 5*^2; — diam(F, || W^^) and k > 1. With probability 
at least 1 — exp (— c'7|(F, || ||^2)/'^'^) '^'^^ ^^■^ 



sup \ Zf\ < C' a max! 

/6F V 



12{F, II ||v,J 72(^, II IIV2) 



Vk ' k 
Moreover, if F is symmetric, 

'72(F, II \U lliF, 



sup \ Zf\ < C' a max I - 



E 

/6F 



Proof. This follows from the proof of Theorem 11.41 with p = 1. More 
precisely, the first part is a direct consequence of ()1.7j) . 

For the "moreover part" first use p.fij) for expectations, estimate the 
middle term by Cauchy-Schwarz inequality and note that Wg < 1 + Zg for 
all g & F' to yield that in order to estimate Esupjg^ \ Zf\ it suffice to bound 

Esup|Z/| and K sup Wf_^rf\. 
f&F' feF 

For simplicity denote 72(-F, || H^a) t>y 72 and let us begin with the 
second term. Applying Remark [l. 61 and setting G = {f — 7r(/) : f E F} and 
u = c''y2{F)/Vk, where c' is the constant from the remark, we obtain 



00 



¥ (supW^ >t] dt <u'^ + P sup >t] dt 



/oo 
exp{—c"vk).dv, 
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where the last inequahty follows by changing the integration variable to t 
v^v. This implies that 



^^2 ^ ^/72 (-^> II II V'2 



for some absolute constant C > 1. 

Next, we have to bound E sup ^^pi \Zf\, and to that end we use Lemma ITTTI 
Setting u = 2Ca'y2{F)/\/k, and then changing the integration variable to 
t = u/2 + a'^w, it is evident that 



F [ sup \Zf\>t]dt<u+ / F [ sup\Zf\>t ] dt 
v/eF' / Ju \feF' 



poo 

,2 



= u + a / PI sup \Zf \ > — + a w \ dw 

Ju/2oP- \feF' 2 J 

POO 

<u + 3q^ / exp (— c"'/cmin(w^, w)) (iw. 

Ju/2a^ 

Changing variables in the last integral w = ru/2a'^ and using the fact that 
72 (-^) > 1 for a symmetric set F in the unit sphere, the last expression is 
bounded above by 

u + i3/2)uj\xp (^-c"'mm (^^, (^)')) dr = C'u, 

where C > is an absolute constant. ■ 



2 Subgaussian Operators 

We now illustrate the general result of Section[T]in the case of linear processes, 
which was the topic that motivated our study. The processes corresponds 
then to random matrices with rows distributed according to measures on 
satisfying some natural geometric conditions. Our result imply concentration 
estimates for related random subgaussian operators, which eventually lead 
to the desired reconstruction results for linear measurements for general sets. 

The fundamental result that allows us to pass from the purely metric 
statement of the previous section to the geometric result we present below 
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follows from Talagrand's lower bound on the expectation of the supremum of 
a Gaussian process in terms of 72 of the indexing set. To present our result, 
the starting point is the fundamental definition of the -functional (which 
is in fact the so-called ^-functional of a polar set). 

Definition 2.1 Let T C M" and let gi, ...,gn be independent standard Gaus- 
sian random variables. Denote by i^:{T) = ^sup^^rplY^^^iQitil, where t = 

There is a close connection between the i^,- and 72- functionals given by 
the majorizing measure Theorem. Let {Gt '■ t G T} be a Gaussian process 
indexed by a set T, and for every s,t & T, let d'^{s, t) = E|Gs — Gt\'^. Then 

C272(T, d) <E sup \Gt\ < C372(T, d), 

where C2, C3 > are absolute constants. The upper bound is due to Fernique 
jF] and the lower bound was established by Talagrand |Talj . The proof of 



both parts and the most recent survey on the topic can be found in [ra2j 
In particular, if T C M" and Gt = ^ Qiti, then d{s, t) = \s — t\, and thus 

C272(r,| |)<t(T)<C372(T,| I), (2.1) 

Definition 2.2 A probability measure fi on M" is called isotropic if for every 
y G M", E| {X,y) p = where X is distributed according to fi. 

A measure n satisfies a ip2 condition with a constant a if for every y G R"", 
II {X,y) 11^2 < a\y\. 

A subgaussian or ip2 operator is a random operator of the form 

k 

r = 5^ (X,, . ) e, (2.2) 
1=1 

where the Xj are distributed according to an isotropic ip2 measure. 

Perhaps the most important example of an isotropic ip2 probability mea- 
sure on R" with a bounded constant other than the Gaussian measure is the 
uniform measure on {—1, 1}*^. Naturally, if X is distributed according to a 
general isotropic %l)2 measure then the coordinates of X need no longer be 
independent. For example, the normalized Lebesgue measure on an appro- 
priate multiple of the unit ball in for 2 < p < cxd is an isotropic il)2 measure 
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with a constant independent of n and p. For more details on such measures 
see (MP] . 

For a set T C M" and p > let 

Tp = TnpS''-\ (2.3) 

The next result shows that given T C M", subgaussian operators are very 
close to being an isometry on the subset of elements of T which have a "large 
enough" norm. 

Theorem 2.3 There exist absolute constants c, c > for which the following 
holds. Let T C M" 6e a star-shaped set. Let fi be an isotropic il)2 probability 
measure with constant a > 1. Let k > 1, and Xi, . . . , be independent, 
distributed according to /i and define T by For < 9 < 1, with 

probability at least 1 — exp{—c6'^k/a'^), then for all x & T such that \x\ > 
rl{9), we have 

{l-0)\x\^<^-^<{l + 9)\x\\ (2.4) 
k 

where 

rliO) = rl{e,T) := inf{p > : p > caX{T,)/{eVk)]. (2.5) 

In particular, with the same probability, every x eT satisfies 

\x\^ < max|(l - e)-^\Tx\yk, r^(e)^}. 

Proof. We use Theorem 11.41 for the set of functions F consisting of linear 
functionals of the form / = fx = (-,3;), for x G T. By the isotropicity of 
II/IU2 = kl f = fx & F. Also, since p is 'ip2 with constant a then it 
follows by (EUD that for all p > 0, 

72(Fnp5'L2, II llv.2) < "72(i^n p^La, II lUa) < (tt/c2)4(Tp), 

as promised. ■ 

Remark 2.4 It is clear from the proof of Theorem 11.41 that the upper esti- 
mates in ()2.4|) hold for ^ > 1 as well, with appropriate probability estimates 
and a modified expression for in ()2.5p . Note that of course in this case the 
lower estimate in ()2.4|) became vacuous. The same remark is valid for the 
estimate in (jl.2|) as well. 
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The last result immediately leads to an estimate for the diameter of ran- 
dom sections of a set T in R", given by kernels of random operators F, and 
which is a V'2-counterpart of the main result from |PTlj (see also |PT2j ). 

Corollary 2.5 There exist absolute constants c, c' > for which the follow- 
ing holds. Let T C be a star-shaped set and let fi be an isotropic 1^2 
probability measure with constant a > 1. Set k > 1, put Xi, . . . ,Xk to be 
independent, distributed according to /i and define T by / li^.i^) . Then, with 
probability at least 1 — exp(— cfc/a^), 

diam(kerrnr) <rl{T), 

where 

rl = rl{T) := infjp > : p > ~d aH.{T,) / Vk]. (2.6) 
Moreover, with the same probability, diam(kerr fl T) < c! cP' i^,{T^ j . 

The Gaussian case (that is, when p is the standard Gaussian measure on 
M"), although not explicitly stated in |PTlj . follows immediately from the 
proof in that paper. The parameter inf{p > : £*(T fl pB^^ < Cp\/k} was 
introduced in |PT2j . 

A version of CoroUarv 12.51 for random ±1- vectors follows from the result 
in as observed in IMF]. 

Proof of Corollary 1231 Applying Theorem 12. HI with 9 = 1/2, say, we get 
that if X G T and |x| > r^(l/2) then Tx 7^ 0. Thus for x G ker Ffl T we have 

< ''"^(1/2) and the first conclusion follows by adjusting the constants. 

Observe that since the function £*(Tp)/p = £^(^{1/ p)TnS"'~^) is decreasing 
in p then actually satisfies the equality in the defining formula (|2.6|) . Com- 
bining this and the obvious estimate C.^{Tp) < i*{T), concludes the "more- 
over" part. ■ 

Finally, let us note a special case of Theorem 12. 31 for subsets of the sphere. 

Corollary 2.6 Let T C S^~^ and let p,, a, k, Xi, F and 9 be the same 
as in Theorem \2. 51 As long as k satisfies k > {c' a^/9'^)i^:{T)'^ , then with 
probability at least 1 — exp(— c6'^A;/a'*), for all x E T, 

Ft 

1-9 <^--^ < 1 + 9, (2.7) 
where c, c > are absolute constants. 
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Proof. Let c, c > be the constants from Theorem 12.31 Observe that the 
condition on k, with c' = c^, is equivalent to rl{6,T) < 1, where T = {Ax : 
xeT,0<A<l}. Then (ITTj) immediately follows from ((231). ■ 



3 Approximate reconstruction 

Next, we show how one can apply Theorem 12.31 to reconstruct any fixed 
V E T for any set T C M", where the data at hand are linear subgaussian 
measurements of the form 

The reconstruction algorithm we choose is as follows: for a fixed e > 0, 
find some t E T such that 

k \ 1/2 

^J2m,v)-{X,,t)f] <e. 

i=l / 

The fact that we only need to find t for which ((Xj, is close to ((Xj, f 
rather than equal to it, is very important algorithmically because it is a far 
simpler problem. 

Let us show why such an algorithm can be used to solve the approximate 
reconstruction problem. 

Consider T = {A(t — s) : t, s e T, < A < 1} and observe that by Theo- 
rem EIHl for every < ^ < 1, with high probability, every such t G T satisfies 
that 

\t-v\<^Q + ri{e,f). 

Hence, to bound the reconstruction error, one needs to estimate r^(6', T). Of 
course, if T happens to be convex and symmetric then T C 2T which is 
star-shaped and thus 

\t-v\<^^+rl{e,2T). 

In a more general case, when T is symmetric and quasi-convex with constant 
a > 1, (i.e., T + T C 2aT and T is star-shaped), then 

\t-v\<^ + rl{e,aT). 

Therefore, in the quasi-convex case, the ability to approximate any point 
in T using this kind of random sampling depends on the expectation of the 
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supremum of a Gaussian process indexed by the intersection of T and a 
sphere of a radius p as a function of the radius. For a general set T, the 
reconstruction error is controlled by the behavior of the expectation of the 
supremum of the Gaussian process indexed by the intersection of T with 
spheres of radius p, and this function of p is just the modulus of continuity 
of the Gaussian process indexed by the set {Xt : < A < 1, t E T} (i.e, 
the expectation of the supremum of the Gaussian process indexed by the set 
{X{t - s) : < A < 1, t, s G T, \t-s\= p}). 

The parameters rl{6,T) can be estimated for the unit ball of classical 
normed or quasi-normed spaces. The two example we consider here are the 
unit ball in denoted by 5", and the unit balls in the weak-^^ spaces 
C,oo for < j9 < 1, denoted by B^^^. Recall that B^^^ is the set of all 
X = {xi)^^i G such that the cardinality \{i : \xi\ > s}\ < for all s > 0, 
and observe that -Bp^oo is a quasi convex body with constant a = 2^/^. Let 
us mention that there is nothing "magical" about the examples we consider 
here. Those are simply the cases considered in |CT11 FnVj . 

In order to bound for these sets we shall use the approach from |GLMPj , 
and combine it with Theorem 12.31 to recover and extend the results from 

irmiRvj . 

Theorem 3.1 There is an absolute constant c for which the following holds. 
Let 1 < k < n and < 6 < 1, and set e > 0. Let p be an isotropic ip2 
probability measure on M" with constant a, and let Xi, . . . ,Xfc be indepen- 
dent, distributed according to fi. For any < p < 1, with probability at least 
l-exp{-ce'k/a'), tfv.y^Bl^ satisfy that (Eti {X,,v - yf /k)^'^ < e, 
then 

where C^fi = ca^ jQ"^ and c> is an absolute constant. 

If v,y G -B" satisfy the same assumption then with the same probability 
estimate, 



1-9 ' V k 
To prove Theorem 13.11 we require the following elementary fact. 
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Lemma 3.2 Let < p < 1 and 1 < m < n. Then, for every x G M", 



sup {x,z) <2p \y2^*i 



N 1/2 

2 



where p = (l/p—l) ^ m}/"^ ^1"^ and {x*)^^i is a non-increasing rearrange- 
ment of 

Moreover, 

1/2 



sup {x, z) < 2p \y 



*2 



i=l 



with p = Xj ^Jm. 



Proof. We will present a proof for the case < p < 1. The case of B"^ is 
similar. 

Recall a well known fact that for two sequences of positive numbers Oj, 6j 
such that fli > (22 > • • •) the sum ^ai6^(j) is maximal over all permutations 
TT of the index set, if 67^(1) > &7r(2) > • • •• It follows that, for any p > 0, m > 1 
and z e n pBl, 

/ - \ 

\j=l / j>r7i 



< 



>i=l 



By the definition of p, this completes the proof. ■ 
Consider the set of elements in the unit ball with "short support" , defined 

by 

?7^ = {xGS"-1:|{z:x, ^0}|<m}. 

Note that Lemma f3.2l combined with a duality argument implies that for 
every 1 < m < n and every / C {1, ...^n\ with |/| < m, 

/| Bl n S''-^ C 2 conv n (3.1) 
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The next step is to bound the expectation of the supremum of the Gaus- 
sian process indexed by Um- 

Lemma 3.3 There exist an absolute constant c such that for every 1 < m < 
n, 

£*(convf/m) < c\/ m \og{cn/m). 

Proof. Recall that for every 1 < m < n, there is a set A^, of cardinality at 
most 5™ such that C 2 conv (for example, a successive approximation 
shows that we may take as A^ an 1/2- net in -B™). Hence there is a subset of 
i?2 of cardinality at most 5™'(^) such that Um C 2convA. It is well known 
(see for example (LTj) that for every T C ' 



£,(convr) = L{T) < cv/log(|T|), 

and thus. 



fconvf/m) < cWloe 5 



n 
m 

from which the claim follows. ■ 

Finally, we are ready to estimate r'^(^, -Bp^oo) r^(^, -B"). 

Lemma 3.4 There exists an absolute constant c such that for any < p < 1 
and 1 < k < n, 

r:(»,B" )<c(l-lVVMc-V»^*)^■"-"^ 



1/2 



and 

Proof. Again, we present a proof for B^^, while the treatment of 5" is 
similar and thus omitted. 

Let < p < 1 and 1 < k < n, and set 1 < m < n to be determined later. 
Clearly, 

1/2 

^xf j = sup {x,y) , 
, i=i J y^^^ 
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and thus, by Lemma l!0 i^^B"^^^ H pB^) < 2p£, where p = {1/p- 
l)~^m^/^~^/^. From the definition of rl(6) in Theorem 12.31 it suffices to 
determine m (and thus p) such that 

which by Lemma comes to coP'^/'m log(cn/m) < Q^k for some other 
numerical constant c. It is standard to verify that the last inequality holds 
true provided that 

~ log (cnQ;*^/6'2A;) ' 

and thus 



Proof of Corollary 13.11 The proof follows immediately from Theorem 12.31 
and Lemma (3.41 ■ 



4 Exact reconstruction 

Let us consider the following problem from the error correcting code theory. 
A linear code is given by an x (n — fc) real matrix A. Thus, a vector 
X G M""'^ generates the vector Ax G M". Suppose that Ax is corrupted by a 
noise vector z G M" and the assumption we make is that z is sparse, that is, 
has a short support, which we denote by supp(2;)={i : 2;^ 7^ 0}. The problem 
is to reconstruct x from the data, which is the noisy output y = Ax + z. 

For this purpose, consider a k x n matrix F such that TA = 0. Thus 
Tz = Ty and correcting the noise is reduced to identifying the sparse vector 
z (rather than approximating it) from the data Tz - which is the problem we 
focus on here. 

In this context, a linear programming approach called the basis pursuit 
algorithm, was recently shown to be relevant for this goal [ CDS) . This method 
is the following minimization problem 

(P) min||t||£^, rt = rz 
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(and recall that the £i-norm is defined by ||t||^^ = ^"^^ \ti\ for any t = 

For the analysis of the reconstruction of sparse vectors by this basis pur- 
suit algorithm, we refer to |CDSj and the recent papers |CT2l ICT3j . 

In this section, we show that if F is an isotropic 1^2 matrix then with high 
probability, for any vector z whose support has size less than ^^^^^j^-^ (for 
some absolute constant C), the problem (P) above has a unique solution that 
is equal to z. It means that such random matrices can be used to reconstruct 
any sparse vector, as long as the size of the support is not too large. This 
extends the recent result proved in |CT2j and |RVj for Gaussian matrices. 

Theorem 4.1 There exist absolute constants c, C and c for which the follow- 
ing holds. Let fi be an isotropic 1^2 probability measure with constant a > 1. 
For 1 < k < n, set Xi, . . . to be independent, distributed according to jj, 
and let F = Yli=i (^«' ') Then with probability at least 1 — exp(— c/c/a^), 
any vector z satisfying 

I / \ I Ck 
|supp(z)| < 



\og{cna'^ / k) 
is the unique minimizer of the problem 

(P) min||t||£j, Tt = Tz. 

The proof of Theorem 14. II is based on a scheme of the proof of |(yT2j . but 
is simpler and more general, as it holds for an arbitrary isotropic ijj2 random 
matrix. 

Let us remark that problem (P) is equivalent to the following one 

in ^^\\y-MU. 

where TA = 0. Thus we obtain the reconstruction result: 

Corollary 4.2 Let A be a n x [n — k) matrix. Set T to be a k x n matrix 
that satisfies the conclusion of the previous Theorem with the constants c 
and C , and for which TA = 0. For any x G W^~^ and any y = Ax + z, if 
|supp(2)| < iog(^^/fc) ; then x is the unique minimizer of the problem 

min \\y - AtWe^. 
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The condition TA = means that the range of A is a subspace of the 
kernel of F. Due to the rotation invariance of a Gaussian matrix (from both 
sides), the range and the kernel are random elements of the Grassmann man- 
ifold of the corresponding dimensions. Therefore, random Gaussian matrices 
A and F satisfy the conclusion of Corollary 14.21 

4.1 Proof of Theorem 14.11 

As in |CT2j . the proof consists of finding a simple condition for a fixed matrix 
F to satisfy the conclusion of our Theorem. We then apply a result from the 
previous section to show that random matrices satisfy this condition. 

The first step is to provide some criteria which ensure that the problem 
(P) has a unique solution as specified in Theorem 14. II This convex optimiza- 
tion problem can be represented as a linear programming problem. Indeed, 
let z eW and set 

/+ = {i : Zi> 0}, r = {i : Zi < 0}, I = 1+ U 1. (4.1) 

Denote by C the cone of constraint 

i&I- i&I" 

corresponding to the £i-norm. 

Note that if |t| is small enough then = Xlie/+('^* + '^*)~Siei'-(^«"'" 

ti) + ^jg/c \ti\. Thus, the solution of (P) is unique and equals to z if and 
only if 

kerFnC = {0} (4.2) 

By the Hahn-Banach separation Theorem, the latter is equivalent to the 
existence of a linear form w G M" vanishing on ker F and positive on C \ {0}. 

After appropriate normalization, it is easy to check that such an w satisfies 
that w = Yl^=i^i-^iy i'^y^i) = 1 for ^ ^ {u!,ei) = —1 for all i G /~, 
and I {w, Cj) I < 1 for all i G P. Setting w = Ylt=i ^i^i noticing that 
{w,ei) = {w,Tei) we arrive at the following criterion. 

Lemma 4.3 Let T be a k x n matrix and 2; G M". With the notation (|^.i| ), 
the problem 

(P) min||t||£j, Tt = Tz 
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has a unique solution which equals to z, if and only if there exists w E M.^ 
such that 

Wi e /+ {w, Tei) = 1, yie r {w, Tci) = -1, Vi G r I {w, Tci) \ < 1. 

The second preliminary result we require follows from Corollary 12.61 and 
the estimates of the previous section. 

Theorem 4.4 There exist absolute constants c, C and c for which the follow- 
ing holds. Let fi, a, k and T be as in Theorem \4.1\ Then, for every < 6 < 1, 
with probability at least 1 —exp{—c9^k/a'^), every x G 2 conv H S"-^^ sat- 
isfies that 

{I - e)\x\' < < {1 + 9)\x\^ (4.3) 

provided that 

9'k/a' 

m < C- 



log {cna^/6'^k) 



Proof. Applying Corollary 12.61 to T = 2conyf/4m fl S*""^, we only have to 
check that k > {c' a'^/6'^)i^:{T)^, which from Lemma f3.3l reduces to verifying 
that k > {c'a^/6'^)cmlog{cn/m). The conclusion now follows from the same 
computation as in the proof of Lemma 13.41 

Proof of Theorem HUl Observe that if t G C n S*""^ then ||t||^^ < 
2 SiG/ 1^*1 — 2\/RT5 where / is the support of z. Hence, 



This inclusion and condition ()4.2j) clearly imply that if F does not vanish on 
any point of a/4|/| B" fl 5*"^^ then the solution of (P) is unique and equals 
to z. By i\3.1\i we have 



n— 1 



n C 2 conv n S' 

Therefore, if F does not vanish on any point of 2 conv U^m H 5*""^ then z 
is the unique solution of (P). Applying Theorem 14. 4[ the lower bound in 
fl4.3|) shows that indeed, F does not vanish on any point of the required set, 
provided that 

Ck 

m < — — ; 777T 

log{cna^/k) 

for some suitable constants c and C. ■ 
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4.2 The geometry of faces of random polytopes 

Next, we investigate the geometry of random polytopes. Let F be a A; x n 
isotropic ip2 matrix. For 1 < i < n let fj = F(ej) be the vector columns of 
the matrix F and set K~^(r) (resp. K{r)) to be the convex hull (resp., the 
symmetric convex hull) of these vectors. 

In this situation, the random model that makes sense is when X = (a;j)"^^, 
where ( independent, identically distributed random variables for 

which E|xip = 1 and ||a;j||^2 — is standard to verify that in this case 

X = {xi)f^i is an isotropic 1^2 vector with constant a, and moreover, each 
vertex of the polytope is given by Vi = {xij)^^i. 

A polytope is called m-neighborly if any set of less than m vertices is 
the vertex set of a face. In the symmetric setting, we will say that a sym- 
metric polytope is m-symmetric-neighborly if any set of less than m vertices 
containing no-opposite pairs, is the vertex set of a face. 

The condition of Lemma f4.3l mav be reformulated by saying that the set 
{vi : z G /+} U {—Vi : i G /"} is the vertex set of a face of the polytope 
K(r). Thus, the condition for the exact reconstruction using the basis pur- 
suit method for any vector z with | supp(2;)| < m may be reformulated as a 
geometric property of the polytope K(T) (see |CT2t iRVj ): namely, that for 
all disjoint subsets and I~ 0/ {1, . . . ,n} such that + < m, the 
set {vi : i e /^} U {—Vi : i G /"} is the vertex set of a face of the polytope 
K{r). That is, K{r) is m-symmetric-neighborly. A similar analysis may be 
done in the non-symmetric case, for K^{r), where now /~ is empty. 

Lemma 4.5 Let T, K{r) and i^"*'(F) be as above. Then the problem 

(P) min||t||£j. Ft = Fz 

has a unique solution which equals to z for any vector z (resp., z > 0) such 
that |supp(2;)| < m, if and only if K{V) (resp., K^{V)) is m-symmetric- 
neighborly (resp., m-neighborly). 

Applying Theorem 14.11 we obtain 

Theorem 4.6 There exist absolute constants c, C and c for which the follow- 
ing holds. Let fi be an isotropic ip2 probability measure with constant a > 1 
and let k and F be as above. Then, with probability at least 1 — exp(— c /c/a^), 
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the polytopes (F) and K{T) are m-neighborly and m- symmetric-neighborly, 
respectively, for every m satisfying 

Ck 

~ a"^ log(cna^//c) 

The statement of Theorem 14.61 for K(r) and for a Gaussian matrix T is 
the main result of |RVj . However, a striking fact is that the same results 
holds for a random { — 1, l}-matrix. In such K~^(T) is the convex hull 

of n random vertices of the discrete cube { — 1, +1}'^, also known as a random 
{ — 1, l}-polytope. With high probability, every (m — l)-dimensional face of 
i^'+(r) is a simplex and there are (^) such faces, for m < Ck/ log{cn/k). 

Remark 4.7 Let us mention some related results about random {—1, 1}- 
polytopes. A result of |BPj states that for such polytopes, the number of 
facets , which are the k — 1-dimensional faces, may be super-exponential 
in the dimension k, for an appropriate choice of the number n of vertices. 
Denote by fq{K~^{r)) the number of g-dimensional faces of the polytope 
i^+(r). The quantitative estimate in [BPJ was recently improved in |GGMj 
where it is shown that there are positive constants a, b such that for k"' < 
n < exp(bk), one has W.fk-i{K~^{T)) > (Inn/ In fc")'^/^. For lower dimensional 
faces a threshold of fq{K~^(T))/ {J^-^} was established in [Kj . 
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